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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Katz, Leonard 

Stassi, Diane L. 
Summers Jr., Richard G. 
Ruan, Xiaoan 
Pereda- Lopez, Ana 
Kakavas, Stephan J. 

(ii) TITLE OF THE INVENTION: NOVEL POLYKETIDE DERIVATIVES 

AND RECOMBINANT METHODS FOR MAKING SAME 

(iii) NUMBER OF SEQUENCES: 34 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Rd. 

(C) CITY: Abbott Park 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: Fas tSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 16-MAY-1979 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA:' 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Dianne Casuto 

(B) REGISTRATION NUMBER: P-40,943 

(C) REFERENCE/DOCKET NUMBER: 4 952. US. P2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: ( 847 ) - 93 8 - 3 137 

(B) TELEFAX: (847) - 938-2623 

(C) TELEX: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO : 1 : 
O^OC, 0^3.^. ™- -™ 

^S^i'crcrc ™ccc c««™ c.c™ ™co. c.c™ 



GO 
120 
180 
240 

:ioo 

^rCCGOC^CC TCACCCTCCC ———I CCCACCCCCA CACCCTCCAA 420 

CAAAACCTGC CCCCCGGCGG CGCCATGCl cCATCWVCGG CCCCCACGCC 480 

CCCCACCTCG ACAACCACCA CGACACCATC ^CCA ^.^^^cCAACT CAACACCAAA 54 0 

ACCGTCCTCT CCGGCGACCG CACCACCCIC CACCACAICC, ^^^^^^^^^^ CCTCATCAGC 600 

660 
72 0 
7fl0 
H40 

GGCAGGGTGT TCAGCGTGGG CCACTCGGTti x^v...oo.. f^^^^ 

CGTACCTCCC TGCCGACGTA CCCCT 



r^.^^r^nnnrn rrPACACCAC CCTCACCGCC CTCGCCCGCA CCAC-CCTCCC CACCALL.AUC 

™ = ^ = ™ = 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103O base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ' ID NO: 2: 
r-rTArrACGG CAGTCCTGCT CACCGGGCAG GGTTCCCAGC GTCAGGGCAT GGGGCGCGAA 

cZt^iS: gg?caccgS gttcgccgcc tcgttcgacg cgatctgcgc tcaactcgac 

rc^C^^Tc ScScSS CAAGGACGTT CTCTTCGCCC CCGAGGGGTC GGAGGACGCC 

ccGC^JS?G SSacgS gttcacacag gcggctctgt tcgccgtgga gacctccctg 
StSggotg? ?cgaggccca cggcctcgtc cccgactacc tcatcggcca ctccatcggc 
nllr^^-Zcl SgSacc? ggccggggtc ctcgatctgg cggacgcgtg cgtcctggtc 
Scccac-S-g gSgccS gcagtcggcc cgggccogcg <.cgcoatgoc cgcggtccag 

r^cScS ACGAGGTACG CGAGGCCCTC GCGACCTTCG ACGATGCGGT TGCCGTGGCC 
GCGAGCGAGG ACGAGGTACG tCCGGCGACG AGGACGCGGT CGAGCGGCTG 

rT™c? g™ca gggcaggSg acgaagcggc tgccggtcag ccacgccttc 
?ScGc?GC acatggacgg gatcgtcgac gagttcgtca ccgccgtctc cgggctcacc 
™gcScc JgaSatccc ggtcgtctcc aacgtcaccg ggaccctcgc caccgtcgac 
cIgSgIcS cgJSgcgta ctgggcacgc cacatccgcg aggccgtgcg cttcgccgac 
gSSgcgS Sctggaggg cgagggcgtc accgaatggc tggagctcgg gcccgacggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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900 
960 
1020 



PTTCTCGTCG CCCTGGTCGA GGACTGCCTG GCGAAGGAGG CGGGATCGCT CGCGTCCGCC 

Sgcgcaagg gggcgagcga gccccacacc gtgggcgcgg ccatggcccg cgcggtgctg 
SSga??cg gccccgactg ggcggcggtg ttccccggcg cacggcgggt cgaccttccg ^^^^ 
acgtatgcat 

(2) information for seq id no : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



E f 1 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATCTACACST CSGGCACSAC SGGCAAGCCS AAGGG 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CTSAAGGCSG GCGGCGCSTA CGTSCCSATC GACCC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCGAATTCC TAGGCTGGCG GTGATGTTCA 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



GCCGGATCCA TGCATACGTC GGCAGGGAGG TAC 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCTCGAATTC GCTGGTCGCG GTGCACCT 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:, linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 0 : 
GACGGATCCG GCCCTAGGCT GCGCCCGCJCT CCJ 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
TTGGGATCCT ATGCATTCCA GCGCGAGCGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



GAGAAGCTTG GCGCGACTTG CCCGCT 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
YTTTTTAAGC TTGGTACCTG CTCACCGGCA ACACCG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
^TT^TTTGGAT CCCTGCAGCC TAGGGTCGGA GGCACTGCCG GT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
TTTTTTCTGC AGTATGCATT CCAGGGCAAG CGGTTCT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
.p^YTTTGAAT TCACGCGTTG CCCGCGGCGT AGGCGC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GATCGAATTC CCTAGGACGG CAGTCCTGCT CACC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GATCGGATCC ATGCATACGT CGGAAGGTCG ACCCG 

(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

I 

TTCGAAGAAT TCCCTAGGGT TGCCTTCCTG TTCGAC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TTCGAAAAGC TTATGCATAG ACCGGCAGAT CCACCG 



(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGGTSAAGTC SAACATCGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GCRATCTCRC CCTGCGARTG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

GAGAGAGGAA CCAACGCGCA CGTGATCGTC GAAGAGGCAC CAGC 

■ t 

(2) INFORMATION FOR SEQ ID NO: 22: 

( i ) SEQUENCE CHARACTER ISTICS : 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAGAGAGGAT CCGACCTAGG CGCGGAGGTC ACCGGCGCGA CGGCG 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GAGAGACCTA GGAAGCCGGT GTTCGTGTTC CCCGGCCAGG GCT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGAGAGGAT CCGAGGCCGG CCGTGCGCCC GGACCGAAGA CCGCCTC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GAGAGAATTC CCTAGGGTCG CCTTCGTCTT TCCCGGGCAG G 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TTGAGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 




1=^ 



3 7 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TT(3AGATCTT ATGCATACGA GGGAAGCGGC ACCCTGC 

(2) INFORr4ATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GCCGACCGTG TCGTGTTCGT GTTCCCCGGC CAGGGCTCGC AGTGGGCCGG AATGGCCGAG 60 

GGGCTGCTGG AGCGGTCCGG CGCGTTCCGG AGTGCGGCCG ACTCGTGCGA CGCCGCGCTG 120 

CGGCCGTACC TCGGCTGGTC GGTGCTGAGC GTGCTGCGCG GGGAACCGGA CGCGCCCTCG 180 

CTCGACCGGG TCGACGTCGT GCAGCCGGTG CTGTTCACGA TGATGGTCTC GCTCGCGGCG 24 0 

GTCTGGCGTG CGCTGGGGGT GGAACCGGCG GCC^GTCGTCG GGCACTCGCA GGGTGAGATC 3 00 

GCCGCTGCCC ATGTCGCCGG TGCGCTGTCG CTGGACGACT CGGCCCGGAT CGTCGCCCTG 36 0 

CGCAGTCGGG CGTGGCTCGG ACTGGCGGGC AAGGGCGGCA TGGTGGCGGT GCCGATGCCG 4 20 

GCGGAGGAGC TGCGGCCGCG GCTGGTGACG TGGGGGGACC GTCTGGCCGT CGCCGCCGTC 4 80 

AACAGCCCCG GTTCCTGCGC CGTCGCAGGC GACCCGGAGG CGCTGGCCGA ACTGGTGGCG 54 0 

CTGCTGACCG GTGAGGGGGT GCACGCCCGG CCGATCCCCG GCGTCGACAC GGCGGGCCAC 6 00 

TCGCCGCAGG TGGACGCGTT GCGGGCTCAT CTGCTGGAGG TGCTGGCCCC GGTCGCCCCC 660 

CGACCGGCCG ACATCCCGTT CTACTCGACG GTGACCGGCG GGCTGCTGGA CGGCACCGAG 720 

CTGGACGCGA CGTACTGGTA CCGCAACATG CGCGAGCCCG TCGAGTTCGA GCGGGCCACA 780 

CGGGCGCTGA TCGCCGACGG GCACGACGTC TTCCTGGAGA CGAGCCCGCA TCCCATGCTG 84 0 

GCCGTGGCGC TGGAGCAGAC GGTCACCGAC GCCGGCACCG ACGCGGCGGT GCTCGGGACC 900 

CTGCGCCGCC GCCACGGCGG TCCTCGCGCG CTGGCCCTGG CCGTCTGCCG CGCCTTCGCG 96 0 

AGGCGGTCTT CGGTCCGGGC GCACGGCCCG TGGAGTTGCC CACCTATQCG 1010 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



ri i 



H (i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 328 amino acids 

O (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Pro Leu Ala Val Met Phe Thr Gly Gin Gly S^r Gin Arg Pro Gly 

15 10 
Met Gly Arg Gin Leu Tyr Glu His Phe Pro Val Phe Ala Gin Ala Leu 

20 25 30 

Asp Glu Val Phe Ala Leu Ala Thr Pro Gly Leu Arg Glu Val Met Phe 

35 40 45 

Asp Pro Asp Gin Ala Glu Thr Leu Gin Arg Thr Asp His Ala Gin He 

50 55 60 

Ala Leu Phe Ala Phe Glu Thr Ala Leu Tyr Arg Leu Trp Glu Ser Trp 
65 70 75 80 

Gly Leu Arg Pro Asp Met Val Cys Gly His Ser Val Gly Glu He Thr 

85 90 95 

Ala Ala His Val Ser Gly Thr Leu Thr Leu Pro Asp Ala Val His Leu 



60 
i20 



CGCGCGCCTG CCTTCGTCTT TCCCGGGCAG GGCGCCCAGT GGGCCGGACT GGGAGCGCGG 
CTCCTCGCGG ACTCCCCCGT CTTCCGCGCC AGGGCCGAGG CATGCGCGCG GGCGCTGGAG 

CCTCACCTCG ACTGGTCGGT CCTCGACGTG CTGGCCGGCG CCCCGGGCAC CCCTCCCATC 3 80 

GACCGGGCCG ACGTGGTGCA GCCGGTGCTG TTCACCACGA TGGTCTCGCT GGCCGCCCTC 240 

TGGGAGGCCC ACGGGGTGCG GCCGGCCGCG GTCGTGGGCC ACTCCCAGGG CGAGGTGGCC 3 00 

GCGGCCTGCG TGGCCGGTGC CCTGTCGCTG GACGACGCTG CCCTGGTGAT CGCCGGACGC 360 

AGCAGGCTGT GGGGGCGGCT GGCCGGGAAC GGCGGGATGC TCGCGGTGAT GGCTCCGGCC 4 20 

GAGCGGATCC GTGAGCTGCT CGAACCATGG CGGCAGCGGA TTTCGGTGGC GGCGGTCAAT 4 80 

GGCCCCGCCT CGGTCACCGT CTCCGGTGAC GCGCTCGCGC TGGAGGAGTT CGGCGCGCGG 54 0 

- CTCTCCGCCG AGGGGGTGCT GCGCTGGCCG CTGCCGGGCG TCGACTTCGC CGGCCACTCG 6 00 

CCGCAGGTGG AGGAGTTCCG CGCTGAGCTC CTGGACCTGC TCTCCGGCGT ACGGCCGGGT 
'^^ CCTTCGCGGA TACCTTTCTT CTCCACCGTG ACGGCGGGTC CTTGCGGCGG CGACCAGCTG 

GACGGGGCGT ACTGGTACCG CAACACGCGC GAACCCGTGG AGTTCGACGC CACGGTCCGG 

GCGCTGCTGC GTGCGGGCCA TCACACGTTC ATCGAGGTCG GTCCGCATCC GCTGCTCAAC 84 0 

W GCPGCGATCG ACGAGATCGC AGCGGACGAG GGGGTAGCGG CCACGGCCCT GCATACGCTC 900 

CAGCGGGGCG CTGGCGGCCT^ TGACCGCGTG CGCAACGCGG TGGGCGCCGC TTTCGCGCAC 960 

^ GGTCTCCGfKJ TCGACTGGAA CGCCCTGTTC GAGGGCACCG GTGCGCGCAG GGTGCCGCTT 102 0 
CCCTCGTACG CCTTC 

(2) INFORMATION FOR SEQ ID NO: 31: 



660 
720 
780 



1035 
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V.1 .h. TH. CI. TKr M.. 01„ «„ ..o «o CX, 

«^ val .U TH, .sp «o „i= TH, u.„ CJn »o „i= X^u «P 

His «P ne s" Xle Ma M. IX. C=X, P.o »is Ma 

Ara Thr Thr Leu His His He Ala Thr Gin 
Thr Val Leu Ser Gly Asp Arg Ttir inr 

MM Lva ihr Aan Trp Leu Am. Val Ser .lis Ala I'he His Ser 
Ltiu Asn Thr Lys inr abii it-v 

p„ «et pro XX. .X„ P.. T,„ Th. T« ... ^.» TK. 

^„ »i. P.O P« „i. TH. P.O ... XX. S« H.t X.U TH. M. 

.H. TH. „X. P.O .SP t". TH. HU T.p TH. .X„ Hi. XX. Ma 
vax .,3 Tvr T.. ".P T« ... Hi, His ..u HXs His Hi, .X, XX. 

T,. .eu Txu XX. CX, P.O »sp Th. Th. ..u T.. Ma X.u Ma 
... P.O T.. TU. ... His ..u XXe P.O ... ^ 

275 ^1 . Ma Tf*n Glv Arq Val Phe 

Asn His Asn Glu Val Arg Ser Thr Asn Glu Ala Leu Gly 

290 ^ r„., Thr Pro Thr Gly Arg 

ser val Gly His Ser Val Asp Trp Arg Ala Leu 

Arg Thr Ser Leu Pro Thr Tyr Pro 
325 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear « 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
P.O TH. M. vax ..u ... .K. OXv OXn OX, S.. 0X„ 0X„ .Xv 

„L OX, OX. xI. TV. «P »S "o V.X P.. Ma Ma ... Ph. 
..p Ma XX. C^s M. 0X„ ... .J CX, CX„ ... P.o Pro ... .VS 

V.X 'Z. P.. Ma pro ox. ox, ... ox. ..p M. M. ... XX. ..P 

vax Ph. Th. OX. M. Ma ... P.. .l- V.X OX. ... ... - 

'pL ..g ... OX. Ma His OX, ... V.X P.o A.p T,. ... XX. OX, 
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His Ser lie Gly Glu Val Thr Ala Ala His Leu Ala Gly Val Leu Asp 

Leu Ala ASP IH Cys Val Leu Val Ala His Arg Gly Arg Leu Met Gin 

120 

ser Ala Arg Ala Gly Gly Ala Met Ala Ala Val Gin Ala Ser Glu Asp 
Glu III Arg Glu Ala Leu Kll Thr Phe Asp Asp Ala Val Ala Val Ala 
Oil val Asn Gly Pro Isn Ala Thr Val Val Ser Gly Asp Glu Asp Ala 

val Glu Arg Leu Ala Arg Trp Arg gIu Gin Gly Arg Arg Thr Lys 

180 

Arg Leu Pro Val Ser His Ala Phe His Ser Pro His Met Asp Gly He 

val ASP Glu Phe val Thr Ala ITl Ser Gly Leu Thr Phe Arg Ser Pro 

215 220 
Thr^lie pro Val Val Ser Asn Val Thr Gly Thr Leu Ala Thr Val Asp 

"in Leu Thr Ser Pro Ala Tyr Trp Ala Arg His He Arg Glu Ala Val 

Arg Phe Ala Asp G^y Val Arg Tyr Leu Glu Gly Glu Gly Val Thr Glu 

260 265 
Trp Leu Glu Leu Gly Pro Asp Gly Val Leu Val Ala Leu Val Glu Asp 

280 2ob 
cys Leu Vll Lys Glu Ala Gly Ser Leu Ala Ser Ala Leu Arg Lys Gly 

Ala ser Glu Pro His Thr III Gly Ala Ala Met Ala Arg Ala Val Leu 

Arg Gly Ser Gly Pro Asp Trp Ala Ala Val Phe Pro Gly Ala Arg Arg 

Val Asp Leu Pro Thr Tyr Ala 
340 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: , ' 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Ala ASP Arg Val Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Ala 

o\y Met Ala Glu G^ly Leu Leu Glu Arg Ser Gly Ala Phe Arg Ser Ala 

20 25 30 

Ala ASP ser Cys Asp Ala Ala Leu Arg Pro Tyr Leu Gly Trp Ser Val 

Leu ser III Leu Arg Gly Glu Pro Asp Ala Pro Ser Leu Asp Arg Val 
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ASP val Val Gin Pro Val Leu Phe Thr Met Met Val Ser Leu Ala Ala 

Val Trp Arg Ala Leu lly Val Glu Pro Ala Ila Val Val Gly His Ser 

Gin Gly Glu lie Tla Ala Ala His Val Tla Gly Ala Leu Ser Leu Asp 

100 105 
ASP ser Ala Arg He Val Ala Leu Arg Ser Arg Ala Trp Leu Gly Leu 

Ala Gly ill Gly Gly Met Val Ala Val Pro Met Pro Ala Glu Glu Leu 

Arg Pro Arg Leu Val Thr ^rp Gly Asp Arg Leu Ala Val Ala Ala Val 

Tsn ser Pro Gly Ser Cys Ala Val Ala Gly ^sp Pro Glu Ala Leu Ala 

165 170 
Glu Leu val Ala Leu Leu Thr Gly Glu Gly Val His Ala Arg Pro He 

pro Gly val Asl Thr Ala Gly His Ser Pro Gin Val Asp Ala Leu Arg 

Ala His III Leu Glu Val Leu III Pro Val Ala Pro Arg Pro Ala Asp 

lie pro Phe Tyr Ser Thr III Thr Gly Gly Leu Leu Asp Gly Thr Glu 



230 



Leu ASP Ala Thr Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe 



Glu Arg Ala Thr Trg Ala Leu He Ala Asp Gly His Asp Val Phe Leu 



Glu Thr ser Pro His Pro Met Leu ^la Val Ala Leu Glu Gin Thr Val 



265 



Thr ASP Tla Gly Thr Asp Ala ICll Val Leu Gly Thr Leu Arg Arg Arg 



280 



285 



295 



300 



His G^ly Gly pro Arg Ala Leu Ala Leu Ala Val Cys Arg Ala Phe Ala 

ITs Gly val Glu val Asp Pro Glu Ala Val IL Gly Pro Gly Ala Arg 

325 

Pro Val Glu Leu Pro Thr Tyr Pro 
340 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
Arg Ala Pro Ala Phe Val Phe Pro Gly Gin Gly Ala Gin Trp Ala Gly 
I.eu Gly Ala Arg Leu Leu Ala Asp Ser Pro Val Phe Arg Ala Arg Ala 
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Glu Ala Cys Ala Arg Ala Leu Glu Pro His Leu Asp Trp Ser Val Leu 

35 40 45 

Asp val Leu Ala Gly Ala Pro Gly Thr Pro Pro lie Asp Arg Ala Asp 

55 ^ ^ 

val III Gin Pro Val Leu Phe Thr Thr Met Val Ser Leu Ala Ala Leu 



65 

Trp Glu Ala His Gly Val Arg Pro Ala Ala Val Val Gly His Ser Gin 



90 95 



Gly Glu val Ala Ila Ala Cys Val Ala Gly Ala Leu Ser Leu Asp Asp 
100 

Ala Ala Leu Val He Ala Gly Arg Ser Arg Leu Trp Gly Arg Leu Ala 

115 120 125 

Gly Asn Gly Gly Met Leu Ala Val Met Ala Pro Ala Glu Arg He Arg 

130 135 140 

Glu Leu Leu Glu Pro Trp Arg Gin Arg lie Ser Val Ala Ala Val Asn 

145 



150 " 155 160 



Gly pro Ala Ser Val Thr Val Ser Gly Asp Ala Leu Ala Leu Glu Glu 

^ Phe Gly Ala Arg III Ser Ala Glu Gly Val Leu Arg Trp Pro Leu Pro 

hj 180 185 150 

m Gly val ASP Phe Ala Gly His Ser Pro Gin Val Glu Glu Phe Arg Ala 

5=; X95 200 205 

m Glu Leu Leu Asp Leu Leu Ser Gly Val Arg Pro Ala Pro Ser Arg He 

— ' n 215 ^ 220 

f pro ?'he Phe Ser Thr Val Thr Ala Gly Pro Cys Gly Gly Asp Gin Leu 

S ASP Gly Ala Tyr Trp Tyr Arg Asn Thr Arg Glu Pro Val Glu Phe Asp 

245 250 
}^ Ala Thr val Arg Ala Leu Leu Arg Ala Gly His His Thr Phe He Glu 

260 265 270 

2 val Gly pro His Pro Leu Leu Asn Ala Ala He Asp Glu He Ala Ala 

Q 275 280 285 

ASP Glu Gly Val Ala Ala Thr Ala Leu His Thr Leu Gin Arg Gly Ala 

290 295 300 

Gly Gly Leu Asp Arg Val Arg Asn Ala Val Gly Ala Ala Phe Ala His 

ITy val Arg Val Asp Trp Asn Ala Leu Phe Glu Gly Thr Gly Ala Arg 

325 330 335 

Arg Val Pro Leu Pro Ser Tyr Ala Phe 
340 345 



